In Defense of C4.5: Notes Learning One-Level Decision Trees

نویسنده

Tapio Elomaa

چکیده

To appear in W. Cohen & H. Hirsh (eds.), Machine Learning: Proceedings of the Eleventh International Conference. (New Brunswick NJ, July 1994.) Morgan Kaufmann, San Francisco CA. We discuss the implications of Holte’s recentlypublished article, which demonstrated that on the most commonly used data very simple classification rules are almost as accurate as decision trees produced by Quinlan’s C4.5. We consider, in particular, what is the significance of Holte’s results for the future of top-down induction of decision trees. To an extent, Holte questioned the sense of further research on multilevel decision tree learning. We go in detail through all the parts of Holte’s study. We try to put the results into perspective. We argue that the (in absolute terms) small difference in accuracy between 1R and C4.5 that was witnessed by Holte is still significant. We claim that C4.5 possesses additional accuracy-related advantages over 1R. In addition we discuss the representativeness of the databases used by Holte. We compare empirically the optimal accuracies of multilevel and one-level decision trees and observe some significant differences. We point out several deficiencies of limited-complexity classifiers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In Defense of C4.5: Notes on Learning One-Level Decision Trees

We discuss the implications of Holte’s recentlypublished article, which demonstrated that on the most commonly used data very simple classification rules are almost as accurate as decision trees produced by Quinlan’s C4.5. We consider, in particular, what is the significance of Holte’s results for the future of top-down induction of decision trees. To an extent, Holte questioned the sense of fu...

متن کامل

دسته‌بندی داده‌های دورده‌ای با ابرمستطیل موازی محورهای مختصات

One of the machine learning tasks is supervised learning. In supervised learning we infer a function from labeled training data. The goal of supervised learning algorithms is learning a good hypothesis that minimizes the sum of the errors. A wide range of supervised algorithms is available such as decision tress, SVM, and KNN methods. In this paper we focus on decision tree algorithms. When we ...

متن کامل

A comparison of stacking with MDTs to bagging, boosting, and other stacking methods

In this paper, we present an integration of the algorithm MLC4.5 for learning meta decision trees (MDTs) into the Weka data mining suite. MDTs are a method for combining multiple classifiers. Instead of giving a prediction, MDT leaves specify which classifier should be used to obtain a prediction. The algorithm is based on the C4.5 algorithm for learning ordinary decision trees. An extensive pe...

متن کامل

Theory and Applications of Agnostic PAC-Learning with Small Decision Trees

We exhibit a theoretically founded algorithm T2 for agnostic PAC-learning of decision trees of at most 2 levels, whose computation time is almost linear in the size of the training set. We evaluate the performance of this learning algorithm T2 on 15 common “real-world” datasets, and show that for most of these datasets T2 provides simple decision trees with little or no loss in predictive power...

متن کامل

Univariate Decision Tree Induction using Maximum Margin Classification

In many pattern recognition applications, first decision trees are used due to their simplicity and easily interpretable nature. In this paper, we propose a new decision tree learning algorithm called univariate margin tree where, for each continuous attribute, the best split is found using convex optimization. Our simulation results on 47 data sets show that the novel margin tree classifier pe...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

In Defense of C4.5: Notes Learning One-Level Decision Trees

نویسنده

چکیده

منابع مشابه

In Defense of C4.5: Notes on Learning One-Level Decision Trees

دسته‌بندی داده‌های دورده‌ای با ابرمستطیل موازی محورهای مختصات

A comparison of stacking with MDTs to bagging, boosting, and other stacking methods

Theory and Applications of Agnostic PAC-Learning with Small Decision Trees

Univariate Decision Tree Induction using Maximum Margin Classification

عنوان ژورنال:

اشتراک گذاری